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Real-time growth rate for general stochastic SIR 
epidemics on nnclustered networks 

Lorenzo Pellis^’*, Simon Spencer^, Thomas House^’^ 


Abstract 

Networks have become an important tool for infectious disease epidemiol¬ 
ogy. Most previous theoretical studies of transmission network models have 
either considered simple Markovian dynamics at the individual level, or have 
focused on the invasion threshold and hnal outcome of the epidemic. Here, 
we provide a general theory for early real-time behaviour of epidemics on 
large conhguration model networks (i.e. static and locally nnclustered), in 
particular focusing on the computation of the Malthusian parameter that 
describes the early exponential epidemic growth. Analytical, numerical and 
Monte-Carlo methods under a wide variety of Markovian and non-Markovian 
assumptions about the infectivity prohle are presented. Numerous examples 
provide explicit quantihcation of the impact of the network structure on the 
temporal dynamics of the spread of infection and provide a benchmark for 
validating results of large scale simulations. 

Keywords: Epidemic, Malthusian parameter, Basic reproduction number, 
Conhguration model. Branching process 
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1. Introduction 

The held of infectious disease epidemiology has benehtted from the use of 
networks both as conceptual tools and as a practical representation of inter- 
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action between the agent s involved in the spread of infections flRansal et ah 


20071: iDanon et al.l. 1201 ih . From a theoretical perspective, they have been 


successfully used to obtain important insight in the behaviour of epidemics 
in idealised populations. Most analytical results, however, have either been 


infection and recovery rates (e.g 

. Diekmann et ah. 

1998; Fames and Keeline. 

2OO2I: V0I7J. 20081 

Ball and Neal. 

2008 

MilleT^. 2011: 

Decreusefond et ah. 2012: 

Barbour and Reinertl. 20131: Janson et ah. 2013h 01 

involve quantities that do 


not depend on the temporal de t ails of the disease dynamics (e.g . iNewmanl . 
2002; Kenah and Robii3 . 2007 : Ball and Neal . 20081 Ball et ah 


20091). In 


this paper we consider general non-Markovian SIR epidemic models and focus 
our attention on arguably the most important piece of information concern¬ 
ing the system’s temporal dynamics: the Malthusian parameter, or real-time 
growth rate. This quantity corresponds to the rate of exponential growth in 
the number of infectives observed in many models when an epidemic takes 
off in a large population and the susceptible population is still large enough 
that its reduction can be ignored. 

Realistic patterns of contact between people typically involve repeated 
interactions with the same i ndividuah and signi hcant heterogeneity in the 
number of contacts reported f Panon et ah . 2012h . From an analytical point 
of view, such a population structure is associated with three problems that 
need to be addressed before the epidemic dynamics can be fully understood. 
These problems are called repeated contacts, infection interval contraction, 
and generational ordering. We begin by illustrating these problems on a 
model scenario. 

Consider an infection spreading between individuals who are able to trans¬ 
mit when they enter in contact with each other. Throughout the paper we 
use the term contact to mean an infectious contact, i.e. a contact that is ‘suf¬ 
ficiently intense’ to result in an infection whenever the individual that makes 
the contact is infectious and the other is susceptible. A precise mathematical 
description of the model is given in Section |2l but in short we assume that, 
upon infection, individuals are attributed a (potentially time-varying) rate 
at which they make infectious contacts. When normalised, this gives the in¬ 
fectious contact interval distribution, i.e. the distribution of times between an 
individual becoming infe cted and the sub s equent infectious contacts made by 


that individual (see also ISvenssonl . 120071: IScalia Tomba et al.l. l2010l: iKenahl . 


201lh . 


When an infectious contact occurs, the infective chooses an individual to 
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Figure 1: Schematic representation of three populations, Pi,P 2 and P 3 . In Pi mixing is 
homogeneous and hence all individuals are connected to each other. In P 2 and P 3 , i, j 
and k all have a small number of neighbours they can potentially infect, although in P 2 
the network is tree-like, while in P 3 the network presents a triangle between i,j and k. 


be the ‘destination’ of the contact among a set of potential ones. Consider 
now a population Pi where homogeneous mixing is assumed, so that any pair 
of individuals can make infectious contacts, and compare it with populations 
P 2 and P 3 where each individual can only make infectious contacts with 
individuals in a hxed set of neighbours (see Figure [I]). In all cases we suppose 
that we are working with a large population of size iV 3> 1, meaning that we 
will ignore effects that are, for example O {N~^). We represent populations 
P 2 and P 3 with a static (undirected) network where edges connect pairs of 
neighbouring individuals. In Section |2] we regard the rate of making infectious 
contacts across each specihed edge as our basic model ingredient. Note that 
population Pi also admits a network representation, namely the complete 
network where each node is connected to all others; however, due to our 
assumption of a large population, the rate of making infectious contacts 
across each specihc edge vanishes and the neighbouring relation becomes 
somewhat ephemeral. 

We now focus our attention on a group of 3 individuals, labelled i, j and 
k, where we assume that i is infectious and the other two are susceptible, 
and we contrast the epidemic dynamics in Pi with those in P 2 and P 3 (see 
Figure [I]). 

First of all, whilst in Pi individual i can potentially infect everybody in 
the population, in P 2 and P 3 , no matter how infectious i is, they cannot infect 
more than four other individuals, thus ‘wasting’ part of their infectivity on 
repeated contacts with the same individuals. 
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Secondly, in Pi each infectious contact is made with an individual chosen 
at random and, because the population is large, it is unlikely for i to contact 
the same individual more than once, so the average time at which trans¬ 
mission (to j, say) occurs is equal to the average time at which i makes an 
infectious contact with j. In P 2 and P 3 , where i may try to infect j multiple 
times, the time of transmission corresponds to the time of the first infectious 
contacts, which will occur on average earlier than the mean time at which 
a randomly selected infectious contact from i to j occurs. We call this the 
problem of infection interval contraction. In choosing this terminology we 
have tried to avoid the ver y similar term ‘gen eration interval contraction’, 
which is already adopted in iKenah et al.l (l2008l) to denote a slightly different 
phenomenon. Their approach based on survival analysis takes into account 
automatically what we here call the infection interval contraction. What 
they focus on instead is the fact that the time at which a susceptible is in¬ 
fected is the minimum of the times of all hrst infectious contacts from all 
potential infectors who are competing for infection of the same susceptible. 
The assumptions we make below avoid this problem as they imply that there 
is never more than one potential infector. In other words, the difference be¬ 
tween infection interval contraction and generation interval co ntraction can 
be exp lained by the two different comparisons being made. In iKenah et ah 


( 120081 ) ■ different levels of infection prevalence are compared within a single 
epidemic model; on the other hand, here we are comparing a network-based 
epidemic (where repeated contacts occur) to a mass-action model (where each 
infectious contact leads to an infection). In particular, because we are inter¬ 
ested in characterising the real-time growth rate, we assume we are in the 
early epidemic phase when the number of cases grows exponentially, i.e. in 
our context the prevalence is assumed to be small. 

Thirdly, once i has infected j in P 2 and P 3 , then j cannot infect i and 
‘loses’ a neighbour due to generational ordering. Considering the next gen¬ 
eration of infection, if j infects k (an individual with only one contact) then 
k actually plays no further role in transmission events due to this effect. In 
Fi, however, j’s infector is a negligible fraction of its total neighbours and so 
this effect is insignihcant. 

What differentiates F 3 from P 2 is the presence of a triangle between i,j 
and k. In the presence of short loops in the network (as it is the case, for 
example, if a small number of individuals all live in the same household), in 
addition to the previous three, two further effects become important. 

Fourth, if i infects j in Fi, the probability that j infects k before i does is 


4 







negligible. Therefore the event that i infects k is not affected by the epidemic 
in the neighbourhood of i or what i did before infecting j. The same occurs 
in P2, because of the lack of a transitive link between i and k. In P3, instead, 
j can infect k before i has the chance to, and so the number of susceptibles 
around i can decrease because of infectious events not caused directly by i. 
We refer to this as the effect of local saturation of susceptibles. 

Fifth, if i infects j, j infects k and i subsequently tries to infect k (an 
event which does not result in an infection as k is not susceptible any longer), 
we have two different ways of attributing cases to different generations of 
infection. Of course, assuming that i is in generation 0, the natural choice 
would be to maintain the real-time description of who infects whom and place 
j in generation 1 and k in generation 2. However, this approach is analytically 
difficult to handle. The alternative is to consider all those that i would have 
infected (both j and k) and place them both in gen eration 1 , in which case the 


trans mission event from j to k is then overlooked flLudwid. Il975l : iPellis et al 


2008h . We term this phenom enon overlapping generations. In IPellis et ah 


(120121) and iBall et al.l (120141 ) the impact of this problem in dehning and 


computing the basic reproduction number Rq in models that involve small 
mixing groups (e.g. households or workplaces) is carefully examined. 

In generic social structures, these five effects often occur simultaneously. 
However, in this paper we consider scenarios in which they can be discrimi¬ 
nated and progress can be made analytically, namely we focus on population 
with the structure of P2, where the problems of local saturation of suscepti¬ 
bles and of overlapping generations described above need not be considered. 
For this reason, we make the strong assumption that that the proportion of 
possible transitive links is negligible. This means that the network is locally 
tree-like. While some progress is possible for real-time growth rate calcula¬ 
tions in the presence of many short loops in the network (like in P3), this 
often inv olves either restriction to compartmental dynami cs or approximate 
methods ( Fraserl. 2007 : Ross et ah . 2010l: Pellis et ah . 2010l) . Analysis of gen¬ 
eral stochastic dynamics of the kind we present here in the presence of short 
loops woul d require a majo r conceptual advance not currently available, al¬ 
though see lBall et al.l . I20l4 for a deeper exploration of these problems in the 
context of households models. 

In addition to our hrst assumption of a locally tree-like network, we make 
two further assumptions. The second assumption is that the individuals are 
sufficiently ‘close’ to each other that the epidemic does indeed grow expo¬ 
nentially during its early phase rather than more slowly (e.g. quadratically. 
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as would be expected on a two-dimensional lattice). More formally, this 
assumption requires that mean path lengths on the giant component are 
O(lnA^). Thirdly, we assume that the distribution of numbers of contacts 
is not too heterogeneous (technically, that the second moment of the degree 
distribution does not diverge) so that the Malthusian parameter does not 
diverge as the system size becomes very large. 

The three assumptions detailed above are fundamental to our approach, 
in the sense that it is not clear how to analyse real-time behaviour of an 
epidemic on a network mathematically if they do not hold. We also make 
other assumptions that signihcantly simplify the analytical results obtained, 
but which can be relaxed with a certain amount of elementary but poten¬ 
tially tedious algebra. In particular, throughout we assume individuals are 
all identical to each other, in the sense that there is no variation in suscep¬ 
tibility and in the network model they only differ in terms of their degree. 
However, a key feature of the present work is a careful treatment of an indi¬ 
vidual’s infectiousness, which is allowed to vary over time, according to some 
stochastic infectivity prohle. These infectivity prohles need not be the same 
for every individual, but we assume that they are independent realisations 
from a specihed distribution that is the same for every individual. We also 
assume that there are no degree-degree correlations, meaning that (given 
our other technical assumptions) the conhguration model can be used for the 
contact structure. 


In setting up our framework, we took inspiration from iDiekmann et ah 


fll998l) . where the authors focus on the computation of Rq in the presence of 
repeated contacts under the same unclustered network approach discussed 
above. However, we here extend their work by adding the computation of 
the real-time growth rate and by moving from their deterministic framework 
to a stochastic one. Any results involving the real-time growth rate has its 
foundations on the so called Lotka-Euler equation, originated in t he he ld of 
demography and discussed, for example, in Wallinga and Lipsitch ( 20071) and 
references therein. In the simple case of a epidemic spreading in a large and 
homogeneo usly mixing popu l ation , the Lotka-Euler equation is derived as 
follows (see IDiekmann et ahl . l2013l p. 10 and 212). Assume the population 


has size N and each infective, t units after having been infected, makes 
infectious contacts with other individuals at a rate B{t). Then the expected 


6 














incidence i(r) at absolute time r satisfies the renewal equation 


ii.T) = J B{t)i{T - t) dt, (1) 

0 


where S{t) is the number of susceptibles at time r. Early on, when almost the 
entire population is susceptible the incidence grows exponentially, S{t)/N ^ 
1 and B{t) in this linearised process describes the rate at which new cases are 
generated by a single infective at time t after their infection. Substituting 
the Ansatz z(r) = into ([T]), we deduce that the real-time growth rate r 
must satisfy the Lotka-Euler equation: 

OO 

j 5(f)e-"* dt = 1. (2) 

0 


Since the left-hand side of Equation ([2]) is continuous in r and has limits of 
0 and OO (assuming B is not zero almost everywhere), a solution must exist 
by the intermediate value theorem. It is also strictly monotonic in r, and so 
any solution must also be unique. 

While we believe the question of real-time behaviour of epidemics on 
networks is of inherent theoretical interest, the main aim of this work is three¬ 
fold: (i) to make clear the key differences between the epidemic dynamics in 
homogeneous mixing populations and on locally unstructured networks; (ii) 
to provide a comprehensive list of results in analytically tractable cases, that 
can be used by modellers to validate the outputs of complex simulations, for 
example individual-based ones; (hi) to provide the tools to assess the accuracy 
of approximations for the real time growth rate that involve either ignoring 
the network structure altogether or neglecting part of t he coniplexiti es to 
achieve sign i ficant ly simpler analytical results, as done in iFraserl (120071) and 


Pellis et al.l (120101) . 


The rest of the paper is structured as follows. In Section |2] we present 
our modelling approach, spelling out the assumptions behind the network 
construction and the epidemic spreading on it, and we define the fundamental 
quantities of interest in the network model and their counterparts in the 
limiting homogeneously mixing scenario. We then consider a list special cases 
in Section |3l for which we provide extensive analytical results. In Sections 
m and 0 we discuss more general analytical considerations and fully general 
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Monte-Carlo methods for numerically computing the real-time growth rate. 
Finally, extensive numerical illustrations are presented in Section E] and hnal 
considerations in Section [3 


2. Model definitions 

2.1. Configuration network model 


We consider a static network representing a population of individuals 
and their interactions. We assume the population size N is large relative to 
the number of infected individuals, which is in turn signihcantly larger than 
unity, as we are interested in studying the asymptotic early spread of the 
infection. 

To ensure that the network is locally unclustered, we ass ume the network 
is con structed according to the so-called configuration model flMollov and Reedl. 


1995 ) with the conventi ons most commonly used in epidemiology ( Ball and Neal 


20081: iBall et al.l. l2010l) . In this model, each individual i [i = 1, 2,..., N) 


is given a degree di which is an independent realisation of a non-negative 
integer-valued random variable D. The distribution of D is called the degree 
distribution. Individual i is then allocated di stubs (half-edges). If the total 
number of stubs in the population is odd, the process of attributing a degree 
to each individual is repeated. If the number of stubs is even, stubs are 
selected at random without replacement and paired. The resulting network 
might have some short loops and self-edges, but if a sequence of such net¬ 
works is constructed and indexed with the population size N and we assume 
that D has hnite mean and variance, short loops and self-ed ges appear wit h 
probability of order 0{N~^) and can be ignored as N ^ oo flDurrettl. 120071) . 


2.2. The infectivity profile 

The function that describes the way an individual’s infectiousness changes 
through time is called their infectivity profile. We make the simplifying as¬ 
sumptions that individuals do not differ in terms of biological susceptibility 
and that the infectivity prohles of different individuals are independent and 
identically distributed according to some known distribution. In addition, we 
assume that individuals can only transmit the infection across edges, that 
each individual’s infectious behaviour applies equally across all links with 
neighbours (in the precise way specihed below) and that the infectious state 
of the individuals does not modify or affect the network structure. 




















When infected, individual i is attributed a realisation Xi{t) from the in- 
fectivity prohle distribution, where t > 0 is the time since the infection of 
i. For each neighbour in the network, i makes infectious contacts at the 
points of an inhomogeneous Poisson process with rate Xi{t). Thus, given 
Xi{t) for individual i, the probability that i makes infectious contact with a 
given neighbour in the interval (t,t +At) is Xi(t)At + o{At). If the individual 
contacted by i is susceptible, then they become infected. We assume that 
the Poisson processes governing the infectious contacts between every pair 
of individuals are conditionally independent given the infectivity prohle of 
individual i. 

We denote by A{t) {t > 0) the random infectivity prohle, which is the 
same for every individual and of which the Aj(t)s are independent realisations 
and we assume that E[/q°° A(t)dt] < oo and E[A(t)]dt < oo and so by 
Fubini’s Theorem these two integrals must be equal. 

2.3. The basic reproduction number Ro 

We begin by calculating Rq, dehned to be the expected number of sec¬ 
ondary infections caused by a typical infective in the early stages of the epi¬ 
demic. Let the random variable A = A(t)dt represent the total infectivity 
spread by an arbitrary infective across each of the links they have. Condi¬ 
tional on degree D = d and total infectivity A = a, the number of secondary 
cases that an infective will cause during the early stages of the epidemic is 
distributed Bin((i— 1,1 — e““). The number of trials d — 1 takes into account 
the fact that the infective must have acquired the infection from one of its 
neighbours, who is therefore no longer susceptible (the generational ordering 
effect mentioned in the introduction). All other neighbours are almost surely 
susceptible because in the early stages almost all of the population is suscep¬ 
tible and short loops in the network appear with negligible probability. The 
probability of transmission occurring across an edge between the infective 
and a susceptible neighbour (1 — e““) takes into account that at least one 
infectious contact across a link is necessary, but all subsequent contacts are 
ineffective. Note that the infection of each of the susceptible neighbours are 
not independent events, but are conditionally independent given A = a. 

A typical infective during the early stages of the epidemic has more con¬ 
nections than the average individual in the network. The probability that 
transmission across an edge reaches an individual of degree d is equal to 
d X P(Zi) = d)/'E[D], which we call the degree-biased distribution. Conditional 
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on having total infectivity a, this individual of degree d then infects on av¬ 
erage {d — 1) (1 — e““) new cases. Since an individual’s degree D and their 
infectivity A are independent, removing the conditioning gives 


Rq — 


E[D{D - 1)] 


E [1 



( 3 ) 


Hereafter we denote the mean degree of the node reached by a randomly 
selected edge by 


d* 


E[D{D - 1)] 

e[d^ 


( 4 ) 


2.4- The real-time growth rate 

In this paper, our primary interest is studying the real-time growth rate 
- the rate of the exponential growth in the number of infectives during the 
early phase of an emerging epidemic - which is denoted by r. Unlike Rq, the 
real-time growth rate depends on the full infectivity prohle A{t) and not just 
its integral A. Of all the infectious contacts towards a susceptible neighbour, 
only the hrst contact results in an infection. The rate at which infections 
occur is therefore equal to the rate at which contacts are made multiplied by 
the probability that no contacts have occurred previously. Let Ai(t) denote 
the rate at which first contacts occur, and hence 

Ai(t) = A(t)e-4o^Wdy (5) 


Note that /Q°°Ai(t)df = 1 — e~^. We can also describe the distribution 
of times since infection at which infectious contacts are made, which has 
probability density function (pdf) w(t) = E[A(t)] /E[A\ and the distribution 
of times since infection at which infections are made, which has pdf wfit) = 
E[Ai(t)] /E[1 — e“"^]. We refer to the former as the infectious contact interval 
distribution and the latter as the generation interval distribution. Let the 
random variables W and Wi denote draws from these two distributions, 
which are both implicitly conditional on the fact that at least one infectious 
contact is made across the edge. The mean generation interval, often referred 
to in the literature as generation time, of the epidemic on the network is 
therefore 

T, = E|H',]. (6) 

Using again the degree-biased distribution to compute the degree of a 
randomly selected new infective, the average infection intensity of a typical 
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infective is therefore 


/3(f) = d*E[Ai(t)]. (7) 

As for the derivation of Equation ([2]), given the rate f3{t) at which a 
typical infective generates new infections t units of time after their infection, 
the expected incidence i(r) at absolute time r satishes the renewal equation 

OO 

i(r) = J — t) dt. (8) 

0 

Substituting the Ansatz i(r) = e'”^, the real-time growth rate r must satisfy 


/5(t)e dt = 1, 


(9) 


which has the same form of Equation ([2]) because all the complexities due 
to the network structure are absorbed in (3(t). Like for Equation ([2]), the 
solution exists (provided /3 is not zero almost everywhere) and is unique. To 
our knowledge, the most comprehensive derivation of this equation from an 


under lying stochastic process on a network is given by iBarbonr and Reinert 


(120131) . and in a deterministic framework bv iDiekmannl (120141) . Note also 


that /3(t) dt = Rqj so Equation (|9]) can be also formulated as 


wi(t)e dt = ^, 
tio 


( 10 ) 


from which it is clear that, for i?o < 1) -Ro = 1 and Rq > 1, the real-time 
growth rate r is strictly negative, zero, and strictly positive, respectively. 

One does not observe the real-time growth rate in practice unless i?o > 1 
and the epidemic takes off. However, if i?o < the rate r of exponential 
decay can still be observed in theory if it is assumed that the number of 
initial infectives is large enough to avoid stochastic fluctuations, but still 
small relative to the total population. 


3.5. Homogeneously mixing model 

To understand the effects that the conhguration network structure has 
on the dynamics of the infection, it is useful to compare the conhguration 
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network model to one in which individuals mix homogeneously. In this model 
an infected individual makes contacts with every individual in the population 
with equal probability and so this model can be thought of as an epidemic on 
a complete graph. Clearly the complete graph does not have the locally tree¬ 
like structure of the conhguration network; however, since each individual in 
a population of size N has N — 1 neighbours, the probability of an individual 
attempting to contact the individual that infected them is 0{N~^) and can 
be neglected when N is large. Hence in its early stages, the epidemics again 
has a branching structure and all hve problems spel t in the introduction 
disapp ear as N tends to inhnity. For more details see IB arbour and Reinert 

feniali . 

More formally an infected individual is allocated an infectivity prohle 
A^{t) and makes infectious contacts to each of their N — 1 neighbours at 
the points of an inhomogeneous Poisson process with rate A^(f), where t is 
the time since they were infected and the superscript h refers to the homo¬ 
geneously mixing population. The expected total rate at which an infective 
makes infectious contacts is therefore = (A^ —l)E[A^(f)]. As we let the 
population size N tend to inhnity, we reduce A^(f) such that /9^(t) remains 
hxed. 

In the limit of an inhnitely large homogeneously mixing population, the 
probability that an infective contacts the same individual more than once is 
zero. Therefore in the early stages of the epidemic, when the population is 
almost entirely susceptible, every infectious contact results in an infection. 
The expected rate at which infections are made is therefore - the same 
as the rate at which infectious contacts are made. This is the key difference 
between the homogeneously mixing model and the network model, as in the 
network model infections occur proportionally to the rate of the expected 
first contact. 

Note that the basic reproduction number for the homogeneously mixing 
model is given by Rq = (3^ (t)dt. Similarly, let w^it) denote the genera¬ 

tion interval distribution, which represents the distribution of times relative 
to one’s infection at which infections are made in a large homogeneously mix¬ 
ing population. Because every contact results in an infection, in a large and 
homogeneously mixing population the contact interval distribution and the 
generation interval distribution are the same. Analogously to the conhgura¬ 
tion network model Wi{t) = f3^{t)/RQ. Note that Wi can also be interpreted 
as the pdf of a random variable, Wi say, representing the time of an infection 
from a typical infective, conditional on at least one occurring. Hence, the 
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generation time = E represents the mean time interval between the 
infection of a case and one randomly selected secondary infe ction, provided 
that at least one occnrred. Alternatively (see SvenssorJ . 2007 1 Tg can eqniv- 
alently be dehned as the time interval between the infection of a case and the 
infection of that case’s infector. Finally let the real-time growth rate for the 
homogeneonsly mixing model be denoted by r^, which is the uniqne solntion 
to the Lotka Enler eqnation 


OO 

j df = 1. (11) 

0 


2.6. Approximation to the real-time growth rate 

For some infectivity profiles the fnnction /3{t) = RoWi(t) may be difficnlt 
to compnte, preventing the calcnlation of r for the configuration network 
model. A possible approximation /^app of l3 involves replacing the generation 
interval distribution wi{t) with the infectious contact interval distribution 
w{t). Hence, 

/^app(f) = Row{t). (12) 


This approximation treats correctly the overall infectivity and the prob¬ 
ability of infections occurring, but approximates the times at which the real 
infections are made with the times at which infectious contacts are made. 
The result is an approximation of the real-time growth rate r on a network, 
given by the unique value rp,pp that sat isfies /3app(t)e“'’'‘pp*df = 1. Some 
authors (IFraserl . 120071: iPellis et al.l . l2010l) have considered a similar sort of ap¬ 
proximation in the slightly more complex case of a households model, where 
the additional problems of local saturati on of susceptible s and overlapping 
generations cannot be neglected (see also iBall et al.l . I20l4 for a careful anal¬ 
ysis of this sort of approximation). 


2.1. Comparisons between models 

In order to make meaningful comparisons between the configuration net¬ 
work and the homogeneously mixing models it is necessary to match certain 
quantities. In particular, we hrst need a matching condition involving the to¬ 
tal infectivity (integrated over time) to establish the probabilities that events 
happen (irrespective of when). Second, we need a way to match the timescale 
dimension of the infectivity profile in the different models, to guarantee a 
meaningful comparison of the temporal dynamics. 
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The expected total infection pressure exerted by an individual in the 
network model is (i*E[74], whilst in the homogeneous mixing model it is R^. 
We take as our hrst matching condition that the overall infection pressure 
from each infective is the same in each model, i.e. (i*E[74] = Rq. Another 
alternative matching criteria could have been assuming that Rq = Rq. The 
reason why we chose the former setting is that Rq = d*E [l — e“"^] already 
takes into account some of the aspects of repeated contacts due to the network 
structure (through the probability of at least one infectious contact across an 
edge 1 —e“"^) which is among the effects we wish to investigate. Furthermore, 
note that if A is random, it is arguably more intuitive to keep hxed the mean 
of A than the mean of the ‘distorted’ random variable 1 — However, 
note that if we increase the connectedness of the network model in such 
a way that Rq is kept constant then we must have that E[A] —)■ 0, which 
implies that E [l — —)■ 0 and therefore these two possible comparisons 

are asymptotically equal. 

The matching on the temporal dimension between the homogeneously 
mixing model and the network model is achieved by assuming that A^{t) cx 
A(t) for all f, almost surely. 

Within either model, care also needs to be taken when comparing the 
impact of various infectivity prohles. Depending on the context, we might 
keep hxed R^ or Rq (i.e. E[A] or E[l — e“"^]) when varying the infectivity 
prohle. Finally, in terms of temporal scale, the choice of what to keep hxed 
in general depends on the details of of the infectivity prohle formulation, so 
we specify clearly how the comparison is made in each numerical example. 


3. Special cases 


We now consider some specihc infectivity prohle distributions where more 
explicit calculations of the real time growth rate can be made. So me of these 


speci al cases have already been considered in other studies (see e.g. lWallinga and Lipsitchl . 

20071 1. although predominantly in large homogeneously mixing populations 

and not in such generality. It is convenient to categorise these examples into 

a few broad classes, as shown in Figure |2l In the hrst class, referred to as 

the time-varying-infectivity class, the infectivity prohle A(t) is non-random 

and therefore is the same for all individuals. The second class is referred to 

as the standard stochastic SIR model class, and assumes constant infectivity 

during an infectious period with a random but specihed length. We then 

consider compartmental models where the infectious period is obtained from 
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Time-Varying Infectivity (TVI) Standard stochastic SIR 




0 1 2 3 0 1 2 3 

Time, t Time, t 

Figure 2: Illustration of the four different model classes considered; a darker shade denotes 
higher probability. 


a phase-type distribution and infectivity can depend on the compartment 
and we conclude with general time varying-infectivity models. 


3.1. Non-random time-varying-infeetivity models 
In the time-varying-infeetivity (TVI) class 


see 


Pellis et ahl. l2010l) . the 


infectivity prohle A(f) is non-random and is therefore the same for all infected 


individuals. Hence the total infectivity across a link A is constant, as well 
as the probability of transmission across a link, 1 — e~^. Therefore, Rq = 
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3.1.1. Gamma-shaped TVI 

Assume that the infectivity prohle is proportional to the pdf of a F- 
distributed random variable with shape parameter a and scale parameter 
7. Let frit) = t > 0, denote the pdf of a Gamma ran¬ 

dom variable with these parameters and let Fr(^) denote the correspond¬ 
ing cumulative density function (cdf). Thus, A(f) = Afr{t) in the network 
model and therefore the matched homogeneously mixing model (see 12.7p has 
/J^t) = flS/r(t). 

In the homogeneously mixing model there is an explicit formula for the 
real-time growth rate. Here, w’^{t) = fr{t) and Tg = a/7. The Lotka-Euler 
equation (ITTD becomes 

00 

Hi j At ^1, 

0 

where the left-hand side is proportional to the moment generating function 
(mgf) of the Gamma distribution evaluated at r^, i.e. j • Hence, = 

i(W)"-i)- 

On the network, we have 

Ai(f) = 

and wi{t) = Ai(f)/(1 — e“"^). The Laplace transform of wi{t) appears to 
be intractable, so quadrature methods are then needed to compute Tg and r 
numericalljfl. 


3.1.2. Reed-Frost model 

In the Reed-Frost model (see lAndersson and Britt^ . 2000), each infected 
individual experiences a latent period of the same hxed duration T after 
which they spread all their infectivity instantaneously. The Reed-Frost model 
forms the limit when the variance in the times of infectious contacts tends 
to zero. 


^In terms numerical stability, when calculating the Laplace transform of Ai(t) for 
small values of A (i.e. when r becomes negative and hence e“''* explodes for large t), we 
recommend using the fact that Ai(t)e“^* = where g is the pdf of a 

Gamma distribution with parameters a and z + 7 . 
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There is only one point in time at which and an infectious contact and 
therefore an infection can occur, and so = Tg = T, and hence w^it) = 
wi{t) = Srit), where 6rit) is the Dirac’s delta function centred at r. Direct 
solution of the Lotka-Euler equation leads to = In {Rq) /Tg and r = 


3.1.3. Heaviside TVI 

In this case the infectivity prohle is some constant A between times u and 
V and zero elsewhere. Hence, A(f) = for the network model and 

= <!{.<*<.} for the homogeneously mixing model, where Q < u < v 
and 1 denotes the indicator function. 

For the homogeneously mixing model, w\{t) = l{u<t<v}/{v — u), Tg = 
{u + v)/2 and satishes the transcendental equation 

- e""’’) = r'^. 

On the network, Ai(f) = and hence 


T 

-t n 


1 u — ve 
A 1 - e-^h-D 


The real-time growth rate r satishes the transcendental equation 

d*-^ (e-™ _ ^-rv-Hv-u)^ ^ 

X+r^ ’ ^ ' 

The parameter u represents the length of the latent period. When u = 0 
this model becomes the SIR model with constant infectious period, which is 
treated in Section [3.2.31 


3.2. Stochastic SIR model 

In the stochastic SIR model (hereafter denoted as sSIR), each infected 
individual remains infectious for a period of duration J, where / is a non¬ 
negative random variable having arbitrary but specihed distribution. During 
the infectious period, infectious contacts towards each neighbour occur at 
the points of a homogeneous Poisson process with rate A. Therefore, A{t) = 
Al{o<t</}, A = XI and E[A(f)] = A (1 — Fi{t)), where Fj denotes the cdf of 
I. 
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For the homogeneously mixing model /3"(t) = and Wi{t) = 

(1 — F/(t)) /E[/]. Integrating by parts, we obtain 


E[/^] 

^ 2E[J] ■ 


(14) 


Similarly, the Lotka-Euler equation (ITT|) becomes 


. 1 - 

° r^E[J] 


1 , 


(15) 


where Mj[z) = E[e“^^] is the moment generating function of I. 

For the network model we calculate Tg and hence r as follows. Recall 
that the random variable Wi represents the time at which infection crosses 
an edge, given that this infection does occur, and that IFi has pdf wi{t). 
Let U ~ Exp(A) be the time to the hrst event in a homogeneous Poisson 
process with rate A, which represents a viable infectious contact if and only 
if it occurs before the recovery time of the infective. Hence, Wi = U\{U < I). 

From the dehnition of conditional expectation. 


M^,(z) =E[e-^’^i] 

“ < I) 


Note also that 


E[e — E 


Ae 


—Xs^—zs 


ds 


= E 


X z 


(l - 


A 


X z 


{l-Mi{X + z)). 


Letting z —>■ 0 in the calculations above we also obtain F {U < I) = 
1 — Mi{X), and hence. 


Mwi {z) 


X f ^ ~ Mj[X + 2;) A 

A + ^ V 1-M,(A) J ■ 


(16) 
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Equation fll6p has been previously derived by Prof. F. Ball and independently 
by Prof. O. Diekmann (personal communications). As a note of caution, 
Mwi{z) as defined in flT6l) is undefined for 2; = —A, but can be extended by 
continuity to give Mvki(—A) = AE[/] / (1 — M/(A)). 

The generation time Tg = E[fPi] can now be obtained via 


T 

-*■ n 


1 

A 


dMr{z) 


dz 

z=X 


l-Mj(A)' 


( 17 ) 


Note that the second term in flTTH is negative, and so Tg < 1 /A. The real¬ 
time growth rate r is the unique solution of the Lotka-Euler equation, which 
in this notation becomes RoMwi{r) = 1. 


3 . 2 . 1 . Markovian SIR model 

The notable special case of the sSIR model for which I is exponentially 
distributed is usually referred to as the Markovian SIR model and is typically 
the most analytically tractable. Let I ~ Exp(7), so that 7 represents the 
recovery rate. Then E[/] = 1/7 and Mi{z) = 7/(7 -f z). 

For the homogeneously mixing model, Wi{t) = ye”'*'*, so that the random 
variable Wi, representing the time to an infection given that one occurs, is 
Exp(7). Hence = I/7, and 


r 


h 


On the network, we have 


'rh 

g 


( 18 ) 


A,(i) 


Ae 0 < t < / 

0 otherwise 


and therefore E[Ai(f)] = The probability of infection across an 

edge is E [1 — e“^] = A/(A -|- 7), which can also be derived by a competing 
hazards argument, andtci(t) = (A-|-7)e“A+7)7 Therefore, Wi ~ Exp(A-|-7), 
whence Tg = 1 /(A -I-7) and l 3 {t) = Rq{X + 7) The real-time growth 

rate r is therefore 


r = 


To ~ 1 

T 

-‘■g 


( 19 ) 


19 







Note the strong similarity between Equations fll 8 l) and fll9l) . If the homo¬ 
geneously mixing model is matched to the network model as described in 
Section [221 (which in this context means keeping both A and 7 constant), 
then we hnd = Xd* — 7 , whilst r = X{d* — 1) — 7 . The term {d* — 1) in r 
instead of d* in accounts exactly for the amount of infectivity wasted in 
repeated contacts towards already infected neighbours. 


3.2.2. Gamma-distributed infectious period 

Assume that I follows a gamma distribution with shape parameter a and 
scale parameter 7 , i.e. I ~ r(a, 7 ). In this case E[/] = a /7 and E[/^] = 
a (1 - 1 - a) / 7 ^ and hence from fITT)) 


rph _ 

-^9 ~ 


\ Oi 

~W' 


( 20 ) 


(see also ISvenssonl . 120071 ). The Lotka-Euler equation ffTHl) can be solved 
numerically to hnd r^. 

For the network model, we obtain from flTTI) that 


_ 1 _ 7+a (7+a) 

J- n — , 


A 


!-(*) 


( 21 ) 


and the real-time growth rate r can be found numerically as the solution of 
RoMwi{r) = 1 using (|T6|) . 


3.2.3. Constant infectious period 

The constant infectious period, in which I = l almost surely, is the lim¬ 
iting case of a Gamma distributed infectious period where a —)■ cx) while 
E[J] = a /7 is kept constant. This also corresponds to a special case of the 
the non-random Heaviside infectivity prohle discussed in Section 13.1.31 when 
u = 0 and v = l. In this case, from fITT)) . | and, from flT5|) . satishes 

1 — 

= 1 - ( 22 ) 

On the network, from fITT)) . 


T 

■J- n 


1 Le 

A 1 — e“^'- 


( 23 ) 
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(24) 


and, from ffTB]) . r is the unique solution of 

\ 1 p—(-^+2)4 

-^^ = 1 . 


A + z 1 


— \l 


3.3. Compartmental models 

We now consider the case where the disease progression in an individual 
is modelled by progression through membership of disjoint compartments, 
connected in general to form any combination of paths in series and parallel, 
wit h different infecti vities in each of them. Such a Markov reward process (see 
e.g. lAsmussenl. 120031) . also sometimes referred to as ‘line ar chain trickery’ (see 


Breda et ah . 20121. p. 105 and 106, or Diekmann, 2014), allows for a flexible 


modelling approach in which quantities of interest can be calculated using 
relatively simple formulae. 

The starting point for the compartmental approach is that there is a hnite 
number of disease states, labelled by integer indices a,b,... between 1 and m, 
each with its own rate of making infectious contact with other individuals. 
Let an individual have a random variable X{t) for its disease state at time 
t, which takes the value S if the individual is susceptible, R if the individual 
is recovered, or R if the individual is in the a-th disease state. We suppose 
that the probability of a newly infected individual starting in state is Ua, 
that the rate of going from R to R is aa,b, and that the rate of going from 
R to R is pia. This general framework is shown in Figure [3l Note that 
infected individuals can potentially start or hnish their infectious life from 
any state R (for example, newly infected cases can develop either severe 
or mild symptoms with some hxed probabilities, and recover at different 
rates; or distinction between compartments can be only hctitious, with the 
aim of achieving sojourn times with distributions other than exponential - 
see Section 13.3.51) and that infectious states may be visited any number of 
times. We assume that all rates are non-negative and we exclude pathological 
cases, by assuming hnite infectivity in each state, no infectivity in state R 
and that individuals cannot be infectious forever, i.e. infectives recover with 
probability 1. For notational convenience, let z/ = (z/i, 1 / 2 ,..., Um) be a row 
vector, denote vector transposition and cxa := J2b^a,,b denote the total rate 
at which an individual in infectious state a moves to other infectious states. 


3.3.1. Network mixing 

Let Xa be the rate at which an infective makes infectious contacts across 
an edge while in state a. From Sections 12.31 and 12.41 we know that, in or- 
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Figure 3: Illustration of a phase-type distribution. 


der to calculate all quantities of interest, we need to describe the average 
infection intensity From Equation ([7]), (3{t) = (i*E[Ai(f)]. This simple 
factorisation, which depends on our assumption that the network structure 
is unaffected by the infection process, implies that it is sufficient to study the 
transmission process across a single edge, as described by Ai(t). To charac¬ 
terise Ai(t) we consider the continuous-time Markov chain defined by events 
and rates 

la h at rate aa,b , 

la ^ R at rate /Xq , (25) 

la ^ J at rate Aq , 

where state J represents the situation where an infectious contact has been 
made and the edge cannot transmit the infection anymore. We decompose 
the state space S of this Markov chain in 5 = MUC, where C = {Ia}a=i,...,m 
are the transient states of the system and A = {R, J} are the absorbing 
states. Consider now the generator matrix restricted to the transient states 
only, which we call Q = {qa,b), where qa,b = CFa,b for a 7 ^ 6 and qa,a = 
— (cTa -b Aa -b fia) («, & G {1, 2 ,..., m}).Note that the rows of Q do not sum 
to 0 because of the transitions into absorbing states. The probability of an 
infective being in infectious state b at time t after their infection if they 
started from state a can be computed using matrix exponentials as 

P[A'(«) = 4|A'(0) = /J =(e'«)_„ . (26) 
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The rate at which the infective transmits along the edge when is state la is 
Aa, so 

m 

A-i(^) = ^ Aal{x(i)=74 (27) 

a=l 

and, from ([7]), 


Pit) = d*E[Ai(f)] 

m 

= d*5^A,p[x(f)=4X(0)~z/ 


or, in vector notation. 


a=l 


/?(i) = (iVe'OA^ 


(28) 


(29) 


Note that /9(t) is described in terms of an exponential of a matrix but that, 
to calculate Rq and the real-time growth rate r, we require only the integral 
or the Laplace transform of /9(f). The theory spelt out in iDiekmann et al. 
(120131) . Section 8.2, considers exactly this situation; therefore, we here only 
clarify that our definition of Q suitably applies to the assumptions behind 
such a theory and present the results. 

Recall that the spectral bound of a matrix is the largest of the real parts 
of its eigenvalues. Then, in our case. 


Lemma 1. The spectral bound of Q is strictly negative. 


Proof. Recall that Q is the generator matrix of the continuous-time Markov 
chain {A^(f) : f > 0} restricted to the transient states C. Note hrst that Q 
is weakly diagonally dominant, i.e. \qa,a\ > Wa,b\- Our assumption that 

individuals recover with probability 1, which is equivalent to all states I a 
being transient states, implies that: Q is either irreducible and there is at 
least one state a such that pia > 0; or Q is reducible, but any subset of states 
for which the generator matrix of the Markov chain restricted to that subset 
is irreducible has at least one state a for which /i^ > 0. Consider such a subset 
of states and denote by Q the sub-matrix of Q involving only the states in 
this subset. Then Q is irreducible and is therefore irreducibly diagonally 


dominant and hence invertible (see, e.g. Horn and .Tohnsonl. 1985 . Theorem 


6.2.27). Further, given Q is weakly diagonally dominant and the elements 
on the diagonal are real and negative, by Gershgorin’s circle theorem all its 
eigenvalues are either null or have strictly negative real part. But invertibility 
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implies 0 cannot be an eigenvalne, so they are all strictly negative and hence 
the spectral bonnd of the entire matrix Q is also strictly negative. □ 


Onc e we have verihe d that the spectral bonnd of Q is negative, Lemma 
8.14 of Diekmann et ah ( 20131) guarantees that, given the off-diagonal ele¬ 
ments of Q (i.e. the transition rates) are non-negative, the integral e“®dM 
converges for t —)■ cxd, 

POO 

/ e“®dM = -Q-^ (30) 

Jo 

and —Q~^ is a positive matrix, whose element in position (a, b) represents 
the average time spent in state b given we started from state a. Therefore, 
we have 


Rn — 


(3{t) dt = d*u 


To calculate Tg we hrst use integration by parts to show that 

poo 

/ ^e-*«d^=(Q-l)^ 

Jo 

rj. _ ir 

^ rmdt iy(-Q-^)X^- 


from which 


(31) 


(32) 


(33) 


Finally, to calculate r from fl29|) and ([9]), notice that can be rewritten 

as where Im is the identity matrix of size m. Then, r can be found 

as the unique solution of 


(- {Q - rim) ) A"^ = 1 . 


(34) 


The theory in Diekmann et al.l ( 20131) . Section 8.2, also guarantees that 
has a unique solution and that = 1 r = 0 and Rq > 1 

r > 0, as one would expect. 


3.3.2. Homogeneous mixing 

The theory described above extends to the case of a homogeneously mix¬ 
ing population with with only minor differences. In a large homogeneously 
mixing population, the probability of making an infectious contact with the 
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same individual twice is negligible, so in this case the Markov chain describing 
the transmission process requires only the following events and rates: 


la h at rate aa^ , 
la ^ R at rate Ha ■ 


(36) 


Again we decompose the state space of the Markov chain in U 

where = {/„} are the transient states of the system and A^ = {i?} is the 
only absorbing state. Consider again the generator matrix restricted to 
the transient states only. Unlike the network model, where we worked with 
the rate of making infectious contacts across each single edge, here we specify 
the rate, rja say, at which an individual in infectious state a makes infectious 
contacts in the population as a whole and we let rj = (r/i, 772, • • •, Vm)- The 
overall rate at which infectious contacts are made in the population is 


m 


(5\t) 

= Y,'nor[x\t) = iaX\Q)r^p\ , 

a=l 

(36) 

or, in vector notation. 


(37) 


Given that the homogeneously mixing and the network cases differ only 
in the infectivity terms and the dehnition of the generator of the Markov 
chain, the same theory as before applies, leading to 


and A being the unique solution of 

z/ {Q^ - rim) = l ■ (40) 

Remark 1. One interesting difference between the network and the homo¬ 
geneously mixing cases is that, on the network. Lemma [1] holds even in the 
case of an SI model, where individuals never recover. Mathematically, the 
reason is that, in addition to the recovery rates, the diagonal terms also in¬ 
volve the rates of infecting the neighbour and are therefore non-zero even 


(38) 

(39) 
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if the jj^aS are all zero. This implies that i?o is finite even if individuals are 
infectious forever. The intuition is that, while in the homogeneously mixing 
model individuals that are infectious forever can lead to i?o = oo, on a net¬ 
work where d* is hnite, the number of new infections an infective can make 
is bounded by the number of neighbours. 


3.3.3. Relationship with dynamical systems 

The approach we have taken here is slightly unusual, in the sense that 
many researchers might hnd it more natural to compute r by linearisa¬ 
tion around the disease-free equihbriurn of a set of ODEs describing the 
infection spread. Diekmann et al.l ( 2013 1. Section 7.2, is concerned exactly 
with discussing the equivalence of these two approaches in the computa¬ 
tion of Rq i n a large and homogen eously mixing population. Adapting the 
notation of Diekmann et al.l ( 20131) . we denote by the number of indi¬ 
viduals in infectious state R in the homogeneously mixing population and 
xh _ X 2 ,..., . Then the linear system for the infection dynamics in 

the early phase of the epidemic can be written as 


dt 


X 


= x\T'^ + Q’^) , 


(41) 


where is a matrix containing only those terms related to transmission 
and Q^, as before, describes the transitions between states (including re¬ 
covery rates, which cause some of the rows not to sum to 0). The m x m 
transmission matrix has the special form = u~^ri because an infective in 
state a generates new cases in state b at rate rnyh, and has therefore r ank 1, 


with the only non-zero eigenvalue given by iyT]~^. Diekmann et ah ( 20131 ) show 


, -1 


how Rq can then be calculated as the dominant eigenvalue of —(Q^) 
which in our case leads to (l38|) . The computation of r follows the same ar¬ 
gument once is replaced by (Q^ — rl), as can be noted by comparing 
Equations (|38|1 and (HOjl above. 

The case of the network model is slightly more complicated, because in 
addition to the infectious state of individuals, their infectivity also depends 
on their degree and the number of edges still available for transmission (the 
used ones are ‘burnt out’ and do not allow further transmission). Therefore, 

we need to write a linear system for the vector x = of the numbers 

of individuals in state a with degree d and s susceptible neighbours (a G 
{1, 2,..., m}, d = 1,2,... and s G {0,1,..., d — 1}). Again, the system can 


26 

















be put in the form (assuming for simplicity a finite maximum degree) 


dt 


X = x{T + S) , 


( 42 ) 


for a suitable transmission matrix T and transition matrix E, and Rq can 
be calculated as the dominant eigenvalue of Kl = —The matrix K, 


Li. 


which can in general have high dimensionality, is referred to in iDiekmann et al. 
(120131 ) as the next-generation matrix with large domain. The authors, how¬ 
ever, explain clearly that one can get a hrst dimensionality reduction by 
focusing only on those states a newly infected individual can start from 
(here, only those states where s = d — 1, the decrease in s being only a 
transition due to the infective making infectious contacts with neighbours). 
Furthermore, a second dimensionality reduction occurs when new cases are 
generated in hxed proportions, so that, stochastically speaking, there is only 
one type of new infective, which starts in each state according to a hxed 
distribution. In our case, any new case has degree d and starts in infectious 
state a with probability VadF {D = d) /E[Z1], so that, in fact, the calculation 
of i?o (and analogously of r) can be reduced to a 1-dimensional equation; 
see flMll and (IMl) . This last simplihcation is the same one we applied in the 
homogeneously mixing case when we noticed that the matrix has rank 1, 
which is the case exactly because new infectives start in each state in hxed 
proportions given by v. 


3 . 3 . 4 . Efficient numerical methods 

We now present a numerically efficient algorithm for the computation of r 
that avoids the costly and potentially numerically unstable matrix inversion. 
To facilitate application, we express it directly in terms of the transition rates 
of the original compartmental model. 

Proposition 1. For a compartmental model as defined above, r is given by 
the largest real solution to the equations 

K-{r + (Ta + Xa + P^ajZair) + ^ (Ta,bZb{r) = 0 , (43) 

67^0. 

m 

d*'^UaZa{r) = 1 . (44) 

a=l 
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Proof. First, define 


Zair) := E 


Ai(t)e dt 


I Jo 


P(0) = la 


(45) 


Then fH5]) follows (after a little algebra) from Proposition 2 of lPollett and Stefanov 
(2002). Combining ([7]) and ([2]), we find that r satishes 


l = d* I E[Ai(t)]e-''*dt 

1 

Ai(f)e-"* dt 

poo 

/ Ai(t)e-'-Mt Y{d) = Ia 


L^o 

m 


= rf*E 
? 

= 

a=l 
m 

= d*'Y^ UaZa 


F[Y{0) = Q 


(46) 


a=l 


which establishes fHTll . The facts that there is a unique such value of r with 
largest real part and that it is real lie in the theory developed above. 


Proposition 2. For homogeneous mixing, is given by the largest real so¬ 
lution to the equations 


K - + (Ta + Fa}z’:{r^) + aa,bz’^{r’^) = 0 , 

b^a 

m 

= 1 . 

a=l 


Proof. Dehne 



E 





P'*(0) 



use fl36l) and follow the analogous reasoning to fj46ll above. 


(47) 

(48) 


(49) 


3.3.5. Examples 

We consider four specihc compartmental structures, for which we explic¬ 
itly state only the non-zero rates, and where the A rates apply to the network 
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model and the r] rates to the homogeneously mixing model. For the hrst three 
of these structures, we do not have different infectivities for infectious states, 
but instead use compartments to model different recovery time distributions. 
There is quite an extensive theory of the ‘phase-type’ distributions that arise 
from use of compartments in this way. In particular, it is known that they are 
dense in the space of positive-valued probability distributions flNeutsl. Il975l) 
and so the present approach can be used, at least in theory, to approximate 
any desired distribution within any required level of accuracy. The fourth 
structure is the standard SEIR model. In general, models with m compart¬ 
ments will lead to m-th order polynomials, with the attendant restrictions 
on what can be said analytically. Nevertheless, as indicated above the values 
of r can be calculated numerically using an efficient algorithm. 

Markovian dynamics. These have already been considered above; however, 
for completeness we note that these would be defined through 


1/1 = 1, /ii = 7 , Ai = A and 71 = 7 , 


(50) 


and we obtain 


r,h_V 
Rn = d* 


A 


rph _ 

9 ~ ^ 

T = _ - _ 

^ 7 + A 


= rj 


7 


7 A ’ 

in agreement with what shown in Section 13.2.11 


r = (d*-l)A-7, 


Hypo-exponential. The hypo-exponential distribution has less variability than 
the exponential and consists of a chain of phases. For example, consider the 
special case of two equally infective compartments in series 


— 1 , O'lg — 7l ; h2 — 72 , 
In this case, we obtain 


Ai = A 2 = A and 71 = 72 = 7 • (51) 


Ro=V 

1 


1 1 
7i 72 


1 1 1 

T„ —- 1 - 

^ 7i 72 7i + 72 


r'* = - (7 - (71 + 72) + \/(71 - 72)^ + 7^ + 2(71 + 72)7) , 
7172 ^ „ 1 


Ra = d* [I- 


T = 

(A-F 7 i)(A- h 72)7 ’ ^ A-F7 i \ + 'j 2 A71-^72’ 

^ = ^ ((d* - 2)A - (71 72 ) V"(d*A -h 72 - 7 i)^ + 4 d*A 7 ij , 


+ 
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which, in the special case of 71 = 72 = 7 leads to the same result that is 
derived from Section 13 .2 .21 when I is Erlang-distributed with a = 2 and scale 
parameter 7 . 


Hyper-exponential. The hyper-exponential distribution has more variability 
than the exponential and consists of a parallel set of phases. For example, 
consider the special case of two equally infective compartments in parallel, 
with different sojourn times 


1^1 = 1^2 = 1/2 , ^ 7 i , ^2 ^ 72 , Ai ^ A2 ^ A and r/i = r}2 = rj . 

(52) 

we obtain 

= + = — + —- - — , 

2 V7i 72/ ^ 7i 72 7 i+72 

j (7 - (71+72) + y(71 - 72)^+ ’ f ), 

^ ,. A(A + A(7i+72)) ^ 1 , _1_ 2 

° (A7i)(A 72) ’ ^ A-h7i A-h72 2 A 7172 ’ 

r = ^ (^{d* - 2 )A - (71 + 72) + V(^*A )2 + (71 -72)2^ . 

Note that in this case the transition matrix is reducible (see Lemma [T]). 
SEIR. The standard SEIR model obeys 


z/i = 1 , (Ti ,2 = d , /n 2 = 7 , A 2 = A and 72 = , (53) 


where S represents the rate of transition from the latent to the infectious 
state and 7 the recovery rate. We obtain 


= — 
2 


Rn = d* 




[V (7 - + 47^ - (7 + ^)) , 

7 -I-A’ ^ ( 5 ~*~A-|- 7 ’ 

^a/(A -F 7 5)2 -k A5{{d* - 1)A - 7 ) - (A + 7 5) j 
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Note that combining the hrst two equations for the homogeneously, one ob¬ 
tains the known expression relating Rq and in the SEIR model, 



(54) 


4. A general approach via direct calculation 

In this section we examine a general approach to the calculation of the 
real-time growth rate r, by direct calculation of the distribution of the time 
taken for infection to pass across an edge, Wi. Once the distribution of Wi 
and its pdf Wiit) have been determined we can easily calculate Tg = E[iyi] 
and /9(t) = d*E[l — e“"^] tci(t), leading to the real time growth rate r via the 
Lotka-Euler equation ([9]). 

Let O be the sample space of the infectivity prohle distribution and let 
A = {A(t) : f > 0} C O be the family of random variables representing an in¬ 
fectivity prohle drawn from this distribution. Recall that infectious contacts 
are made between an infected individual and each of their neighbours at the 
points of an inhomogeneous Poisson process with rate A(t). Let M represent 
the total number of contacts made across an edge, and hence M ~ Pois(A), 
where A = A(t) dt. Recall that hP is a random variable representing 
the times at which infectious contacts are made and Wi represents the time 
of the hrst of these infectious contacts. Clearly these two random variables 
are conditional on at least one infectious contact occurring at some point, 
i.e. M > 0. 

Proposition 3. The equation below links the cdf of Wi to the cdf of W given 


A. 


¥{Wi < x\M > 0) = 1 — 


Iq-AP(W<x\M>0,A) _ g-A' 


Ea[ 1 - e-^] 


where A = A{t) dt. 


Proof. 


P(IPi < x\M > 0) 


F(Wi < X, M > 0) 
P(M > 0) 
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Conditioning on the joint distribution of M and A yields 


F(Wi < x\M > 0) 


In <x\M = m, A) dP(M = m, A) 

P(M > 0) 

Ea[E“=i = m, A)P(M = m\A)] 

P(M > 0) 


Given M = m > 0, the hrst contact time Wi is the minimum of m iid 
realisations of W, and so P(iyi < x\M = m, A) = 1 — P(hh > x\M > 0, A)™'. 


p(iyi < x\M > 0) 


EaE“^i( 1 -P(hh > x\M > 0,A)™)P(M = m|A)] 
P(M > 0) 

EAE^^iP(hh > a;|M > 0, A)"^P(M = m|A)] 
Ea[P(M > 0|A)] 


Finally recall that MIA ~ Pois(A) and hence ETz^IAI = e and 

P(M = 0|A) = e-^. 


If the cdf of W is known then Proposition [3] can be used to determine the 
cdf of IFi. 


4-1. Exponential total infectivity 

In this section we consider models in which the total infectivity A = 
A{t) dt is exponentially distributed. Important examples in this class of 
models includes the sSIR model with exponential infectious periods, the TVI 
model where A{t) = Af{t), where A is exponentially distributed and / is any 
proper pdf. 


Proposition 4. If A is exponentially distributed then: 

1. M ~ Geom(p), where p = 1/(E[A] + 1) and P(M = m) = p(l — p)^ 
for m = 0 , 1 ,.... 

2. M|(M > 0) ~ 1 + Geom(p). 

3. The cdfofWi is given by 


F(Wi < x) 


p¥{W > x) 

1 — (1 — p)P(IF > x) 


(55) 


Proof. 1. M\A ~ Pois(A) and removing the conditioning yields P(M = 
m) = p{l — p)^. 
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2. 

3. 


This follows directly from the lack-of-memory property of the geometric 
distribution. 


Marshall and OlkinI fll997l ) show that, given any iid sequence of random 


variables Xi, X 2 ,... with cdf Fx{x) and iV ~ 1 + Geom(p), the cdf of 
U = min{Xi,.. .,Xn} satishes 1 - Fu{x) = • By 2, 

Wi is the minimum of a geometrically distributed number of random 
variables, distributed according to W. 


Marshall and Olkin f 1997fl considers families of random variables F in 


which the minimum of a random number N of iid variables is also within 
the family F. The authors show that in such cases N must be geometrically 
distributed and the family is characterised by fISBD with 0 < p < 1. 


5. Numerical method 

It is possible to calculate the real-time growth rate r numerically for 
the most general model described in this paper by following the procedure 
detailed below. First we use Monte Carlo sampling with Poisson thinning to 
obtain an estimate for E[Ai(t)] and hence /3{t) = (i*E[Ai(t)]; then we solve 
Equation ([9]) iteratively for r. 

We begin by choosing a grid of G values, ti < t 2 < ■ ■ ■ < to, over which 
we will apply a quadrature method to compute the integral in ([9]) (e.g. the 
trapezium rule). It is important that the grid covers the range of t for which 
the integrand has a signihcant contribution to the integral. Finding a suitable 
value for tc is usually not a problem because, since /Q°°E[Ai(t)] dt is hnite, 
then E[Ai(t)] tends to zero as t —)■ cxd and, in the most interesting case of 
r > 0, so does e“'’h 

Next we simulate M samples from the infectious contact distribution. For 
the Ath sample t > 0}, hrst calculate ruj = sup{A(d(t) : t > 0} 

and then simulate events from a homogeneous Poisson process with rate 
rrii, for example using the Gillespie algorithm. This algorithm begins at 
time 0, and the time until the next event is drawn sequentially from an 
exponential distribution with rate m*. To obtain a sample of events from 
the inhomogeneous Poisson process with rate A^*^(t), we accept event j at 
time Tj in the homogeneous Poisson process with probability A*^®^(rj)/mj, 
and reject the event otherwise. Let denote the time of the hrst accepted 
event, that is the time of the hrst event in the inhomogeneous Poisson process 
with rate For more details regarding this Poisson thinning procedure 
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see 


Kingmaiil ( 1992 ). We need only to continue simulating events in the 


homogeneous Poisson process until either the hrst event is accepted or the 
time of the hrst accepted event is known to be after the last of our grid 
points, to- Finally to complete the calculation of E[Ai(f)], we use the fact 
that 


E[Ai(f)] = E[A(f) P(no event in (0,f)|A)] 

= E[A(f) E[l{no event in (0,t)}|A]] 

M 

i=l 

and hence that 

M 

i=l 

Note that we can use the same set of simulated A*^d to calculate 

the integrand in ([9]) for all of our grid points ti,... ,tG and therefore the 
computation time needed for the hrst part of the procedure does not depend 
on the number of grid points G. 

The second part of the algorithm involves the use any iterative method 
(e.g. the secant method; we used MATLAB® built-in function fzero) cou¬ 
pled with any quadrature method (e.g. the trapezium rule) to hnd r from 
Equation ([9]). 


6. Numerical results 

In this section we explore the key epidemiological quantities in both the 
homogeneously mixing and the network models, for various choices of the 
network degree distribution and the infectivity prohle. When a TVI model 
is used we assume a gamma-shaped infectivity prohle, i.e. proportional to 
the pdf of a gamma distribution, where the shape parameter a takes values 
of 0.5, 1, 2, 10 and the limiting case of a —)■ oo, which corresponds to the 
Reed-Frost model where all the infectivity is spread at time Tg = Tg. When 
a sSIR model is assumed, the duration / of the infectious period is assumed 
to follow a gamma distribution, again with shape parameter a = 0.5,1, 2,10 
and the limiting case of a —)■ oo, which corresponds to a hxed duration 
infectious period. The choices of a in both cases, correspond to the gamma 
distributions where the ratio between the variance and the mean is 2, 1, 1/2, 
1/10 and 0, respectively. 
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6.1. Network model outputs 

In Figures m and O we plot three network-related outputs - Tg (left col¬ 
umn), i?o (middle column) and r (right column) - as a function of the basic 
reproduction number for the corresponding homogeneously mixing model 
(see section [277)1 . Figure 0] examines the TVI model (with hxed Rq and Tg) 
and Figure 0] examines the sSIR model (at hxed and E[J]) for different 
network degree distributions: regular (top row), Poisson (middle row) and 
negative binomial with variance 5 times larger than the mean (bottom row). 
Together with r, we also show the approximation described in Section 12.61 
for each infectivity prohle. 

Note how the value of Rq unequivocally determines the value of Rq when 
the infectivity prohle is non-random (Figure 01 central column), but not 
when it is random (Figure |5l central column). Other observations are: Tg 
and Tg are always identical in the Reed-Frost model; increasing the infec¬ 
tivity leads to shorter generation times on the network and larger values of 
the real-time growth rate (with the exception of small infectivities for very 
overdispersed durations of infection in the sSIR model, e.g. when a < 1). 
Finally, the strongest ehects of infection interval contraction and repeated 
contacts, i.e. shorter Tg, lower Rq and r, and therefore also a less accurate 
approximation, are experienced when d* is smallest: at hxed mean degree, 
this is the case of the regular network (zero variance); at hxed distribution, 
for smallest mean degree (Figure E]). With the latter in mind, we chose here 
a mean degree of E[T)] = 3, which is large enough to allow the existence 
of a giant connected component, but small enough to accentuate saturation 
ehects. 

The impact of the network structure is highlighted in Figure El where 
four networks with diherent degree distributions (regular, Poisson, geomet¬ 
ric and negative binomial with variance 5 times larger than the mean) are 
investigated. The real-time growth rate was plotted as a function of the mean 
degree E[T)]. Except for the geometric degree distribution, for all other net¬ 
works the ratio Var {D) /E[T)] is constant and the magnitude of this constant 
is rehected, for any chosen infectivity prohle, in the clear ordering of r from 
regular to Poisson to negative binomial. The comparison between diherent 
networks and infectivity prohles is performed as follows. All infectivity pro- 
hles are from the sSIR model and have the same value of E[J]. We hxed 

= 0.5, and from this the corresponding values of Rq and E[y4] were ob¬ 
tained, which were then used to compute r. Note how all network models 
converge to the homogeneously mixing model in the limit of large mean de- 
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Figure 4: The network-related outputs (the generation time Tg, first column; the basic 
reproduction number Rq, second column; and the real-time growth rate r, together with 
its approximation Tapp, third column) as a function of the basic reproduction number of 
the corresponding homogeneously mixing model, Rq, for the TVI model with gamma¬ 
shaped infectivity profiles having shape parameter a = 0.5,1, 2,10 and oo (the Reed-Frost 
model). Each row considers a different degree distribution: regular (Var (Zl) = 0), first 
row; Poisson (Var (D) = E[Z1]), second row; and negative binomial (for which we imposed 
Vai (D) = 5E[I1]), third row. All networks have mean degree E[I1] = 3, a reasonably 
small value to accentuate the network saturation effects on disease spread. All infectivity 
profiles match the same value of Tg = 1. The black vertical dotted line indicates the value 
of Rq for which i?o = 1, which depends on the degree distribution. The horizontal black 
dotted line shows where = 1 (middle column) or r = 0 (right column). 
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Figure 5: Network-related quantities (the generation time Tg, first column; the basic 
reproduction number i?o, second column; and the real-time growth rate r, together with 
its approximation rapp, third column) as a function of the basic reproduction number of 
the corresponding homogeneously mixing model, i?Q, for the sSIR model with gamma¬ 
shaped infectious period distributions having shape parameter a = 0.5,1,2,10 and oo 
(i.e. constant infectious period). Each row considers a different degree distribution: regular 
(Var {D) = 0), first row; Poisson (Var {D) — E[Z1]), second row; and negative binomial (in 
which Var (H) = 5E[I1]), third row. All infectivity profiles have infectious periods with 
E[/] = 1 and no latent period. All networks have mean degree E[Z1] = 3, a reasonably small 
value to accentuate the network saturation effects on disease spread. The vertical dotted 
lines indicate the value of i?Q for which Rq = \ for each infectivity profile (corresponding 
shades of grey). The horizontal black dotted line shows where i?o = 1 (middle column) or 
r = 0 (right column). 
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gree. Finally, if we were to plot r as a function of Rq (not shown) then the 
choice of the degree distribution would be irrelevant, because the only im¬ 
pact of D is in the factor d*, which appears identically in the computation 
of both Ro and r. In other words, the relationship between Rq and r given 
by Equation ffTOj) only depends on Wi (i.e. on wi{t)), which is affected by 
the choice of the infectivity prohle, but not by the overall mean infectivity 
or network structure. 

6.2. Model comparisons 

Figure [7] highlights the importance of being unambiguous about what is 
kept hxed when a model comparison is performed. Even in the absence of 
network structure. Figures [7^ and [7 )d show how, for the sSIR model with 
Rq > 1, can increase or decrease as the variance of the infectious period 
duration I decreases (the shape parameter a increases) when either E[J] or 
Tg are kept hxed, respectively. On the other hand. Figures [7t and[71i show 
how, for the sSIR model on a network (regular with degree 3 in this example), 
r can diverge as Rq approaches d* or not, depending of whether E[/] or Tg is 
kept hxed, respectively. The approximation described in Section 12.61 is also 
shown, and it is found to be particularly inaccurate because this example 
tests it at the limit when ehects of repeated contacts and infection interval 
contraction are strongest. 

The intuitive reason why r diverges when E[J] is kept hxed (Figure [Tt) 
is that, as Rq ^ d*, the infection rate A cxo, which means that the time 
at which the hrst infectious contact occurs shrinks to 0 and thus r —)■ cx3. 
Because the approximation ignores precisely this aspect (the time of the 
hrst infectious contact is approximate with the time of a randomly selected 
contact), unlike r, the approximation r^pp does not diverge. Furthermore, as 
A —)■ cxD the generation time Tg shrinks to 0. Therefore if Tg is kept hxed 
instead of Tg (as in Figure [T]!), E[J] and Tg diverge, but the resulting r 
does not (because the time of the hrst infectious contact does not shrink 
to 0). The approximation Tapp even decreases because the average time of 
an infectious contact diverges. The same phenomenon of r exploding as in 
Figure [7t tends to occur also for the TVI model (not shown) although as the 
shape of the infectivity prohle becomes more peaked around its mode, the 
infection rate just after infection decreases. This prevents Tg from getting 
too close to 0. In the limiting case of the Reed-Frost model Tg = Tg, and 
therefore r = Tapp and neither of them diverge (nor decrease) when Rq ^ d*. 
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Mean degree, E[D\ 


Figure 6: Impact of the network degree distribution: r as a function of the mean degree 
for fixed = 0.5, using the sSIR model with gamma distributed infectious periods having 
shape parameter a = 0.5,1, 2,10 and oo (i.e. constant duration) and four different degree 
distributions: regular (Var (£)) = 0, continuous line), Poisson (Var (£)) = E[ZI], dashed), 
geometric (Var {D) = E[D] (1+E[D]), dash-dotted) and negative binomial with Var (D) = 
5E[£)] (dotted). All infectivity profiles share E[/] = 1 and = 0.5. The horizontal thick 
black and grey lines show r = 0.5 and r = 0. For increasing mean degree all network 
models converge to the homogeneously mixing model and the smaller mean degrees are 
associated with stronger network saturation effects and hence slower epidemic spread. 


39 






a) 


b) 




3 


sSIR Gamma, a 

= 0.5 

sSiH Gamma, a 

= 1 

sSiR Gamma, a 

= 2 

- sSiH Gamma, a 

= 10 

- sSIR Const 


-Approximate r 



1.5 



'0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2 


i?0 


i?0 


Figure 7: Illustration of which quantities to keep fixed when the impact of different in- 
fectivity profiles is compared. First row: relationship between and Rq in a purely 
homogeneously mixing population and sSIR infectivity profile with gamma distributed 
duration of the infectious period I having shape parameter a = 0.5,1, 2,10 and oo (con¬ 
stant duration). In (a) E[/] = 1 and in (b) Tg = 1 from Equation (fT4|) are kept fixed. 
Note the opposite monotonic relationships between and Rq when Rq > 1 in the two 
cases. Second row: relationship between r and Rq on a regular network with degree D = Z 
for sSIR infectivity profiles with gamma distributed duration of the infectious period I 
having shape parameter a = 0.5,1,2,10 and cx) (constant duration). In (c) E[/] = 1 and 
in (d) Tg = 1 is kept fixed. Note how, as Rq ^ 2, r diverges in (c) but not in (d); the 
approximation rapp does not diverge in (c) and even shrinks in (d). 
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6 .3. Estimating Ro from r 

Above we have explored how the real-time growth rate and the basic 
reproduction number change as functions of the model parameters. However, 
r is often one of the few readily available quantities that can be measured 
from attainable data. Therefore, it is interesting to explore how our estimates 
of Ro based on observed values of r are affected by the model structure. 
More specihcally, assume we observe an exponentially growing epidemic and 
that we somehow obtain a measurement of the real-time growth rate, which 
we denote hereafter by f. Assume also that the social contact structure is 
properly represented by a locally tree-like network (i.e. our network model 
represents ‘the truth’, which may admittedly not be realistic). We may 
then ask: how is ability to estimate Ro affected by the potential lack of 
information about such contact structure? And how is it further reduced if 
in addition we also have no information about the infectivity prohle except 
for the generation time? Figure |H] explores exactly these issues, for a regular 
network with degree 3. 

In the top and bottom rows of Figure [8] we consider the TVI and sSIR 
models, respectively. In the left column we assume f is observed and infor¬ 
mation about the infectivity prohle is available by accurate observation of 
infected cases. Denote by Ro{r) the relationship between r and Rq on the 
network and by R!^{r^) the relationship in a homogeneously mixing popu¬ 
lation. Then Figures |8^ and [HI:; show the ‘true’ Ro{r) (i.e. on the network) 
associated to the observed exponential growth and the i?o(f) that would 
be estimated from = f in a homogeneously mixing population with the 
same infectivity prohle: in [H^ the TVI prohles share the same Tjf, while in 
jHt the sSIR prohles all have the same E[J]. In the right column we take a 
diherent approach: in both cases we know the type of infectivity prohle and 
its variability (TVI with a parameter for the shape in [8 ]d and sSIR with a 
parameter for the distribution of I in jH]!) , and in addition we assume we can 
measure the generation time intervals, which we denote by Tg, by observing 
the times at which new cases arise as the epidemic unfolds. Let us denote by 
Ro{r,Tg) the relationship between Ro, real-time growth rate and generation 
time on the network and by RQ{r^,Tg) the corresponding relationship in a 
homogeneously mixing population. In Figures |8 ]d and |8li we then hrst plot 
the ‘true’ Ro{r, Tg) associated to the observed exponential growth and gener¬ 
ation time, and we compare it with the Rq (r, Tg) that would be estimated in 
a homogeneously mixing population where cases would grow exponentially 
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Figure 8: i?o as a function of r on a regular network with degree D = 3. First row: the 
TVI model is assumed, having gamma shaped infectivity profiles with shape parameters 
a = 0.5,1, 2, 10 and oo. In (a) is kept fixed and in (b) we impose Tg = Tg (see main 
text). Second row: the sSIR model is assumed, having gamma distributed duration of 
the infectious period / with shape parameter a = 0.5,1, 2, 10 and oo. In (c) E[/] is kept 
fixed and in (d) we impose Tg = Tg (see main text). The dashed lines represent the 
homogeneously mixing model limit for each infectivity profile, where r = and Rq = Rq. 
The black dotted lines show where r = 0 (vertical) and i?o = 1 (horizontal). 
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at a rate = r with generation time = Tg (note that here we are fixing 
Tg also for the sSIR model, rather than IE[/]). Although our methodology 
is also valid when r < 0, the setup in this section is meaningful only when a 
large epidemic is being observed. Therefore, we restrict our attention to the 
case of r > 0. 

The first observation from Figure |H] is that, at fixed infectivity profile 
(and for r > 0), the estimates of Rq obtained by ignoring the social structure 
are always conservative, i.e. Rn is alway s larg er than R n. This result i s 
in line with the findings of iKenahl (1201 ll) and iBall et ahl (jin preparation! ) . 
The latter study, in particular, is more general as it considers additional 
social structures, namely multitype and households models. However, in the 
present work we examine more extensively the impact of different infectivity 
profiles. 

Assume now that information about the full infectivity profile is not avail¬ 
able: then from Figure [8^ we know that, if we assume a TVI model and we 
only know r and Tg, then ignoring the social structure and assuming a Reed- 
Frost model with latent period of length leads to the most conservative 
estimate about Rq. The discrepancy between Rq and Rq can potentially be 
quite large, especially for large r. However, in realistic settings the value of 
d* can be significantly larger than the value d* = 2 chosen here to accentuate 
network effects, leading to a much smaller discrepancy. On the other hand, if 
the sSIR model is assumed and only r and E[/] are known, no obvious upper 
bound is seen in Figure IHt, as overdispersed distributions for / can make the 
difference between Rq and Rq arbitrarily large. However, as shown in Figure 
[7] (top line), if Tg is kept fixed instead of E[J] the monotonic relationship in 
r is reversed and the constant duration of the infectious period would then 
lead to the most conservative estimates for Rq. Furthermore, Rq is allowed 
to grow unbounded because of the lack of a latent period: if such a period 
were added, then a combination of a random latent period followed by an 
infectious period concentrated in a single point in time, would lead to the 
most conservative estimate of Rq (as is the case for the Reed-Frost model in 
Figure [Sti-)- 

Finally, if we were to decide which observation scheme to adopt between 
measuring Tg (or E[/]) by carefully studying the shedding profile of an in¬ 
fective or measuring Tg by observing the time interval of cases during an 
unfolding epidemic, then the latter would lead to estimates of RQ{f,Tg) ob¬ 
tained by ignoring the social structure altogether that are much closer to the 
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“true” RQ{r,Tg). This is true both when the full details of the infectivity 
prohle are known and when no information about it is available except for 
the estimated Tg (in particular in Figure [Hi). However, we do recognise that 
measuring Tg from data might not be a straightforward task (even ignor- 
ing realistic logistic constr aints), because of potential observation biases (see 


Scalia Tomba et ah . 2010 [ for a thorough discussion of such biases). 


6.3.1. Reproduction numbers and pair approximation models 

It is worth mentioning that, in addition to Rq, various other reproduc¬ 
tion numbers have been dehned in t he literature for e pidem ics sp reading in 


popul ation with a social structure. [Goldstein et ah! fl2009h and [Ball et al 


f[2014j) analysed the properties of numerous such quantities in the con t ext o f 
epidemic models with two levels of mixing, while [House and Keelind ( 201111 
used pair approximation methods to analyse four thresholds for Markovian 
epidemic dynamics on regular networks (i.e. where each individual has the 
same number of neighbours) with signihcant levels of clustering, i.e. in the 
presence of short loops like those in population P 3 in Figure [H While some 
of these reproduction numbers are beyond the scope of our current paper, 
we note that one is particularly relevant for a locally tree-like network struc¬ 
ture {P 2 in Figure [J): the exponen t ial-gro wth associated reprodu ction num¬ 


ber, called Rr by Goldstein et al.[ (120091 1 and [Ball et al.[ (120141 1 and tq by 


House and Keelind (j201l[) . This is dehned as the reproduction number one 


would infer from knowledge of the contact interval distribution w{t) and the 
early exponential growth rate r, assuming the network structure is ignored. 


I.e. 


Rr '.= 


frui(t')e-'‘dt' 


(56) 


Note that Rr is dehned as a network quantity that aims at approximating Rq, 
more than the result of a comparison between a network and a homogeneously 
mixing model. However, if we assume that r = f is observed, then Rr = 
R^if) exactly as discussed in the previous section when the two models are 
compared at hxed contact interval distribution {w = w^). Therefore the 
relationship between Rq and Rr is fully described by Figures [Hti- and[Ht. 

The context is conceptually diherent when r is not observed but calcu¬ 
lated exactly from the basic parameters of the network model. This is the 
typical case of pair approximation models on networks. Because the focus is 
usually on tracking the epidemic dynamics using a small system of ODEs, 
these models usually assume a constant recovery rate 7 (i.e. exponentially 
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distributed duration of infection). The real-time growth rate r is obtained by 
linearising the system of ODEs for the number of pairs of nodes in different 
states and is given by r = A(d* — 1) — 7 , in agreement with Section I3.2.1I 
Then, from the dehnition of w{t) and Equation fl56|l . 


Rr = l + - = (d* -1)- . 
7 7 


(57) 


This is a well-dehned reproduction number, which shares the same threshold 
at 1 with Rq (dehned in Section I2.3p and whose expression might sometimes 
be preferable to that of Rq. However, it must be noted that Rr does not 
respect the standard verbal dehnition of the basic reproduction number as 
the average number of secondary cases infected by a typical case in a large and 
almost fully susceptible population (i.e. when the depletion of susceptibles 
can be neglected and the number of cases is growing exponentially). In 
particular, R^- is not bounded by the average number d* of neighbours of a 
typ ical new infective, as r ha s no upper bound. 

House and Keeling f 201l[l showed, in the case of Markovian dynamics on 


regular networks, that Rq as dehned in Section 12.31 can be recovered from 
diherential equation models where variables are added allowing the numbers 
infected in diherent infectious generations to be explicitly counted. Lin¬ 
earisation of this extended dynamical system around the disease-free equi¬ 
librium yielded a value of Rq equ al to that dehned in Section 12.31 and we 
conjecture, based on the results of Barbour and Reinert ( 20131) . that a sim¬ 
ilar augmented dynamical systems analysis of non-Markovian dynamics on 
conhguration model networks with heterogeneous degree distributions would 
yield our more general expression for Rq. 


7. Conclusions 

We have presented a comprehensive description of analytical and nu¬ 
merical computation of the real-time growth rate for Markovian and non- 
Markovian models of infection spread on unclustered networks, providing 
explicit calculations in many important special cases. Although the strong 
assumption of no clustering represents the main limitation of the present 
work, exact temporal dynamics in the presence of clustering appear to be 
theoretically intractable and very few exact analytical results are currently 
available. 
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In the comparison between homogeneously mixing and network models, 
we have highlighted here how there is a strong need to be clear about how 
the comparison is performed, as different choices on which quantities are 
kept hxed in the comparison can lead to very different conclusions about 
the impact of networks. However, in general we can conclude that, for any 
chosen infectivity prohle, at equal mean total infection pressure of a typical 
infective {Rq = d*E[y4]) or at equal mean total number of new cases generated 
by a typical infective {Rq = — e"^] = Rq), the presence of a network 

results in faster spread compared to a homogeneously mixing model. The 
fundamental reason is what we called infection interval contraction: in the 
presence of repeated infectious contacts towards the same susceptible, the 
actual infection occurs at the time of the hrst infectious contact, which is 
always stochastically smaller than the time of a randomly selected infectious 
contact. This also implies that, at hxed observed r, neglecting the network 
structure l eads to overes timating the value of Rq. This has al ready been 
observed in Kenahl f 201ll) and a formal proof of it can be found in ( Ball et al 


in preparatiorJl . 


We have mostly considered conditions where the effect of infection inter¬ 
val contractions is exacerbated by low mean degree, high infectivity and/or 
large variability or temporal spread of the infectious period. Some of these 
conditions occur in contexts such as that of sexually transmitted infections 
(many individuals are monogamous or nearly-monogamous and long infec¬ 
tivity prohles - e.g. for HIV - imply many repeated contacts with the same 
partner) and this reinforces the standard trend of using network models in 
this research area. However, in most other realistic epidemic systems with 
a large and reasonably dispersed degree distribution and realistic infectivity 
prohles, the impact of infection interval contractions is often mild (if de¬ 
tectable at all). This suggests that, for example in the context of airborne 
infections, detecting the presence of a network social structure by only look¬ 
ing at aggregate epidemiological measures such as Rq and r is unlikely, but 
also that the overestimate of Rq from observed r obtained neglecting the 
social structure can in general be quite accurate. The latter is fundamen¬ 
tally the reason why, in pair approximation models, the exponential-growth 
associated reproduction number R^ is often numerically very similar to Rq. 
Furthermore, the same reason e xplai ns wh y the appro x imati on described in 
Section [2T] adopted by Fraserl ( 2007 1 and Pellis et al. ( 2010 1 in the case of 
inhuenza lead to reasonably accurate approximations of r or good estimates 
of Rq and the household reproduction number from r. 
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